Neural Network Classifiers Estimate Bayesian a posteriori Probabilities

نویسندگان

  • Michael D. Richard
  • Richard Lippmann
چکیده

Many neural network classifiers provide outputs which estimate Bayesian a posteriori probabilities. When the estimation is accurate, network outputs can be treated as probabilities and sum to one. Simple proofs show that Bayesian probabilities are estimated when desired network outputs are 2 of M (one output unity, all others zero) and a squarederror or cross-entropy cost function is used. Results of Monte Carlo simulations performed using multilayer perceptron (MLP) networks trained with backpropagation, radial basis function (RBF) networks, and high-order polynomial networks graphically demonstrate that network outputs provide good estimates of Bayesian probabilities. Estimation accuracy depends on network complexity, the amount of training data, and the degree to which training data reflect true likelihood distributions and u priori class probabilities. Interpretation of network outputs as Bayesian probabilities allows outputs from multiple networks to be combined for higher level decision making, simplifies creation of rejection thresholds, makes it possible to compensate for differences between pattern class probabilities in training and test data, allows outputs to be used to minimize alternative risk functions, and suggests alternative measures of network performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Introduction to Inference and Learning in Bayesian Networks

Bayesian networks (BNs) are modern tools for modeling phenomena in dynamic and static systems and are used in different subjects such as disease diagnosis, weather forecasting, decision making and clustering. A BN is a graphical-probabilistic model which represents causal relations among random variables and consists of a directed acyclic graph and a set of conditional probabilities. Structure...

متن کامل

Estimates of constrained multi-class a posteriori probabilities in time series problems with neural - Neural Networks, 1999. IJCNN '99. International Joint Conference on

In time series problems, where time ordering is a crucial issue, the use of Partial Liklihood Estimation (PLE) represents a specially suitable method for the estimation of parameters in the model. We propose a new general supervised neural network algorithm, Joint Network and Data Density Estimation (XWDE), that employs PLE to approximate conditional probability density finctions for multi-clas...

متن کامل

Nonlinear Discriminant Features Constructed by Using Outputs of Multilayer Perceptron

This paper 1 proposes a method to extract nonlinear discriminant features from given input measurements by using outputs of multilayer Perceptron (MLP). Linear Discriminant Analysis (LDA) is one of the best known methods to construct linear features which are suitable for class discrimination. Otsu showed that LDA can be extended to nonlinear if we can estimate Bayesian a posteriori probabiliti...

متن کامل

Equivalence Proofs for Multi-Layer Perceptron Classifiers and the Bayesian Discriminant Function

This paper presents a number of proofs that equate the outputs of a Multi-Layer Perceptron (MLP) classifier and the optimal Bayesian discriminant function for asymptotically large sets of statistically independent training samples. Two broad classes of objective functions are shown to yield Bayesian discriminant performance. The first class are “reasonable error measures,” which achieve Bayesia...

متن کامل

A Critical Overview of Neural Network Pattern Classifiers*

A taxonomy of neural network pattern classifiers is presented which includes four major groupings. Global discriminant classifiers use sigmoid or polynoniial computing elements that have “high” non-zero outputs over most of their input space. Local discriminant classifiers use Gaussian or other localized computing elements that have Lbhigli” non-zero outputs over only a small localized region o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Neural Computation

دوره 3  شماره 

صفحات  -

تاریخ انتشار 1991